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This invention provides methods, apparatus, system, and 
article of manufacture which solve the problem of mismatch 
between the keywords employed by a user in making a query 
and those assigned by the manual or automatic classification 
system stored in the system's keyword index. The system 
records the initial keyword(s) input by the user and holds 
them until the user is either satisfied or gives up . If in a query 
session the user is satisfied with the object(s) retrieved from 
the repository, the system associates the initial keywords(s) 
with the retrieved object(s). This facilitates the object's 
retrieval by the same user or subsequent users who input the 
same keywords. The keyword index is modified directly 
when, for example, a single service or application controls 
the interaction between user and repository, end to end. 

39 Claims, 5 Drawing Sheets 




Index 



06/08/2004, 



EAST Version: 1.4.1 



U.S. Patent May 27, 2003 Sheet 1 of 5 US 6,571,239 Bl 



< 00 

CD 0 




06/08/2004, EAST Version: 1.4.1 



U.S. Patent May 27, 2003 Sheet 2 of 5 



US 6,571,239 Bl 




06/08/2004, EAST Version: 1.4.1 



U.S. Patent May 27, 2003 Sheet 3 of 5 



US 6,571,239 Bl 




06/08/2004, EAST Version: 1.4.1 



U.S. Patent 



May 27, 2003 



Sheet 4 of 5 



US 6,571,239 Bl 



X 
CD 



CD 
E 

o 

_C 

0 
CD 



O 
O 



o 




cd 

I 

<D 
"O 

o 
c 




CD 



E 
o 



CD CD 



X 
CD 



E 
o 

CD 

■»— • — 

J? X 

"o co 



T3 
O 



x 

"D 

C 



■D 

o| 

E O. 



06/08/2004, EAST version: 1.4.1 



U.S. Patent May 27, 2003 Sheet 5 of 5 US 6,571,239 Bl 




06/08/2004, EAST Version: 1.4.1 



US 6,571,239 Bl 
1 2 

MODIFYING A KEY- WORD LISTING BASED methods are required to update the keyword list and classi- 

ON USER RESPONSE fication scheme used in information retrieval systems. 

SUMMARY OF THE INVENTION 

HELD OF THE INVENTION s ln one me presen{ j nveDt ion provides a method, 

The present invention relates to the field of information apparatus and article of manufacture to modify the associa- 

retrieval. More specifically, it is directed to a system and tions between objects in the database and keywords in the 

method for incrementally updating and modifying a key- index, based on keywords supplied by the user during a 

word index in response to user feedback search session- These modified associations keep the clas- 

10 sification of objects in the database up to date. 

BACKGROUND OF THE INVENTION ^ of ^ prescm deflnes , m 

Existing keyword search engines for information reposi- session as an interaction of a user expressing a query with a 

tories typically have two components. The first component database of objects and associated keywords. The invention 

may be described as a system for classifying a corpus of provides a method for determining user's satisfaction with 
documents or other objects, such as images. The result of 15 me results of the search. 

this process is a set of indices or similar data structures that Other aspects and a belter understanding of the invention 

associate keywords or terms with the documents or other may be realized by referring to the Detailed Description, 
objects. The second component provides a means for a user 

of the search engine to express a query. This component . BRIEF DESCRIPTION OF THE DRAWING 
analyzes the query and uses the data structures provided by 20 These and other objects, features, and advantages of the 
the first component to provide a set of objects which are present invention will become apparent upon further con- 
deemed to be relevant to the user's query. sideration of the following detailed description of the inven- 
The classification of the information may be a manual lion when read in conjunction with the drawing figures in 
process, in which a human classifies each object. This which: 

classification may be done by one or more skilled specialists. FIG. 1 shows an example of a system implementing the 

Alternatively, it may be done by relatively less skilled present invention; 

individuals such as the authors or creators of the objects, FIG. 2 shows an example embodiment of the logic for 

who select descriptive keywords. In another alternative, the deletion of keyword-object associations in accordance with 

classification of documents may be an automatic process, the present invention 

performed by computing systems and software. This gen- p rr . * ™ u a c * .> 

erally uses a variety of means to extract a meaningful the te™J?h,^ ^ t process flow of updating 

description of each object. This includes, for example, £S7E££r m ^ the 

analyzing text in the document's tide, abstract, or body, or ^ n ' ? , c 

by automatically detecting patterns in images and so on - shoW5 cxam P lc of 311 ovcral l Sow of the system 
Wh „ f . .„ .. , 35 m accordance with the present invention. 

Whether the classification is manual or automatic, in nir , - . , 

either case the result is a keyword list with associated data ' 5 . S m exam P le embodiment of an updating 

that can be used to retrieve relevant objects for a user query. a PP aratus m accordance with the present invention. 

Again, this is accomplished in existing embodiments by a DESCRIPTION OF THE INVENTION 

variety of means. At its simplest, each word or phrase in the An Th* „r„c<,„t • • a ^ 

keyword fist points to a list of objects which were classified *° J^JTlZ*«T SFT 1 ^ % IS"™? ^ 

by that word or phrase, and that list of objects is returned 1^1!^ 5 *5 - Pr ° blem ° f 

when the query contains that word or phraie. Many varia- ^nerv tT5£ ^ A 7*1? * " ^ 

tions of this basic method have been developed, including J™? b J the """V ° T aUt ° maUC 

ranking the returned objects by analytical or statisfical 45 t^ZZ^?T,f " ^ 

properties that are intended to reflect the presumed relevance 45 L ?H C ^ ^ , L thc J™ a{ mvC f on ' thc 

or importance of the object. f C mitoal **y?™K*) wpm by the user and holds 

c . . * . . J them until the user is either satisfied or gives up. If in a query 

From the viewpoint of the invention described here, these sess ion the user is satisfied with the obtectfs) retrieved from 

t^ZL" 1 : if P0SSe T t 7 h sh0rt00min g S - J irs *y> the repository, the system associates the inK£S$ 

keywords and terms employed by a user m making a query so with the retrieved object(s). This facilitates the objec 's 

manna? ZT^l T^Z* ^ ^ ¥ f Wttfav,d by ** ^ ™ ™ who inpuUhe 

ZSfw^ f5T T claSSxfica r ff ° SySte f , ,! nd the same keywords. The assodation^new keywords Jh data 

search words of different users may differ greatly from one objects is implemented in different ways, depending on 

another. Secondly, while the sorts of queries provided by an whether modification of the master keyword index fa 
!U™ . I* T g f r ? latlvcl y ra P idl y in rcs P° nsc * allowed or desirable. Two alternative example embodiments 

to current events changing fashion, and other vagaries of are described below. Alternative Adetails a case in which the 

public mteres , the clarification scheme itself is relatively keyword index is modified directly. This is feasible for 

c tL aD th " hT BeCaUSe K° f K b0th th6Se CXam P lc ' whcn a ™& ""ice or application controls the 

factors, there can be a substantial mismatch between the interaction between user and repository, end to end. Alter! 

iSSSSTS! SyStem ^V- 0 Cla f ,fy ° bjeCtS ^ Dd that 60 ^e B details the case in which, the master keyword index 

SSrS m ,K m . 8 ^ qUeneS " ^ iS 3 ^notmodmed.ThfaisthecasewhenmeprocesTmteracti^ 
mismatch that grows with time. ^ ^ M docs not havc pcnnissioa t / ch thc mast ^ 

the result of this mismatch is that the search engine may index. This occurs when, for example, only experienced 

tail to find many of the relevant objects, and more of the librarians may have authorization to modify it In this case 

objects it does retrieve will be irrelevant to the intended 65 new keywords are stored in an auxiliary index An external 

query. Thus the user may frequently fail to find the desired process merges both the master and auxiliary indices before 

information. In order to provide better search results, new returning the results to the user. 
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An example of a system implementing the present inven- ing keywords (or Boolean expressions) with the object(s) or 

tion includes the parts shown in FIG. 1. A User Interface descriptions) of the object(s) chosen by the user. This is 

(100), for users to input queries, which may be on the same done by linking each keyword with the object's id returned 

machine as the rest of the system, is connected in a client- by the Search Engine (110) or the Auxiliary Index (140). 

server architecture or via the Internet. Users formulate s Under both alternatives, the Receiver (100) passes this 

queries in a variety of ways: freetext (or natural language), association information gathered during the session to the 

keywords (with or without Boolean expressions), by select- Updating Module (125). Under Alternative A, the Updating 

ing choices from a menu, etc. The queries are passed to a Module (125) updates the master Keyword Index (120) with 

Receiver (105). When the first query arrives from a user, the this information. Under Alternative B, it updates the Aux- 

Receiver initializes a session log, and associates the log with 10 aiar y Index (140). 

this first query. The Receiver (105) closes the log when the In a particular embodiment, the Updating Module (125) 

user inputs a response that indicates satisfaction, or when the optionally keeps statistics (130) about the session informa- 

session times out because the user has been inactive past a tion. This includes information such as a time stamp, which 

certain time limit. The Receiver (105) passes the query to the indicates how recently the keyword(s) have been used. It 

rest of the system for further processing. Under Alternative 15 often also keeps other statistics, such as the number of 

A, the Receiver passes the query to a Search Engine (110), sessions associated with keyword(s), to indicate the fre- 

which matches the query against the (modified) Keyword quency of use of the keyword. These statistics are subse- 

Index (120). The keywords in the Keyword Index (120) arc quently employed to make decisions about whether and how 

associated with one or more objects in the Repository (115). long to keep associations between keywords and data 

If the query matches one or more keywords in the Index 20 OD jects either in the master Keyword Index (120) in Alter- 

(120), the data objects associated with those keywords, or a native A, or in the Auxiliary Index (140) in Alternative B. 

description of these objects, such as a hitlist of titles, The logic for deletion of keyword-object associations 

thumbnails, etc., are retrieved from the Repository and takes various forms. One embodiment shown in FIG. 2, sorts 

returned to the Search Engine (U0). The Search Engine associations from oldest to newest and from lowest to 

(110) in turn passes them to the user interface (100). In 2 5 highest frequency. When the Index (120), or Auxiliary Index 

Alternative B, where the Keyword Index does not change, (140), reaches a maximal size determined by the application 

the Receiver (105) passes the query to the Updating Module or the constraints of the hardware, the oldest and least 

(125) besides passing it to the Search Engine (110). The frequent associations are deleted. A precise formula com- 

Updating Module (125) matches the query against the bining date and frequency, is often defined as needed 

Auxiliary Index (140), using a matching identical or similar 30 Associations that are new and frequent are generally kept in 

1°^%?^^ ^ De , by ^ Search ^ Amnluiy so much that they reflect current and common ways' of 

Index (140) includes keywords associated with data objects referring to objects. These are most beneficial to subsequent 

1^ JS 0 ? 10 '?' In 801116 cmb °diments, »c Auxiliary users. Associations that are new and infrequent arc also kept 

Index (140) has the same structure as the Keyword Index while they are new since their frequency of usage may still 

( ^ ^ . •^ tCrna(lV6 B ' b0th ^ Scarch EngiDC (U0) 35 change. Associations that are old and frequent are kept under 

and the ; Auxiliary Index (140) return the relevant data Alternative A, as part of the master Index. This is especially 

objects (or descriptions) to Rcsponder (135). The Responder so, if they are reviewed by an expert, such as a librarian 

(135) merges the two lists and passes the combined result to They are probably best removed from the Auxiliary Index 

the user interface (100). (140), under Alternative B. * 

This iterative process-query, match against keywords, 40 FIG. 3 describes an example of an updating process flow 

and return of relevant data^ontinues until the user's When a user session ends with a response indicating 

response indicates satisfaction with one or more objects satisfaction, one or more associations are made between 

• retrieved. This marks the end of a query session. Satisfaction keywords in the original user query and the objects satisfy- 

could be measured in several ways. Some level of satisfac- ing the query session. These associations are then added to 

tion is indicated if the user requests to see the object in full, 45 either the master Index or the Auxiliary Index. A determi- 

which may take time. Further satisfaction is indicated by nation is made to determine if the association exists already 

other responses. For example, satisfaction is indicated when (305). If yes, its statistics of date and usage are updated 

the user starts a transaction to pay for downloading the (325), if they are kept by the system. If the association does 

object. The end of a session can also be explicitly indicated not exist, it should be added. A determination is made to 

by a user response, in which the user marks objects as 50 determine if the index is at maximum size (310) If yes the 

relevant or irrelevant at the User Interface (100). This most "deletable" association is deleted to make room (315) 

constitutes an accurate way to measure user's satisfaction Then the new association is added (320) with its statistics 

with the objects. Unfortunately, users seldom provide this (325). 

kind of feedback even if the User Interface (100) is capable FIG. 4 describes an example showing an overall flow of 

AMk! ^ f A • tt» d ^ , 55 the& y stem ^ e ^ er sub m»ts a query (401) which is matched 

At the end of the session, the Receiver (105) analyzes the against the Keyword Index (405) and against the Auxiliary 

first query into keywords. In one embodiment, this analysis Index (410) under Alternative B. If this is the user's first 

is trivial, such that the whole query is taken to be a single query in a session (412), the query (and its statistics) is 

keyword. In an alternate embodiment the Receiver (105) stored in the Updating module (413). The matched keywords 

actually separates the query into single words and/or miilti- 60 arc used by the system to retrieve objects associated with 

word phraseS ft This depends upon how much knowledge the them (425). The objects (or their description) are then 

Receiver (100) has of the structure of the Keyword Index. If displayed to the user (430). If the user enters a response 

Boolean operators are used in the query (Clinton AND which indicates satisfaction (440), the first query stored (in 

Lewinsky, for example), these combined objects are also step 413) is parsed into keywords (445). Each keyword is 

stored. If appropriate, the Receiver breaks up a complex 65 associated with an object, and optionally, statistics of date 

Boolean query into simpler, logically equivalent Boolean and usage are updated for each association (450) Under 

expressions. The Receiver (105) then associates the result- Alternative A, the Master Index is updated with these 
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associations directly (455). Under Alternative B, the Auxil- from a user session, if the session ended in a sufficient level 

iary Index is updated (460). of satisfaction. In some embodiments, the step of removing 

If the user's response (440) does not indicate satisfaction, is repeated in accordance with a removal policy, 

this may be either because there was no response at all (435), Id ^ embodiments, the user is connected to the reposi- 

in which case the session is considered over; or it may be s tory via an Internet connection 

because the response was a further query, reiterating the ,„ ■ i j , , r 
interacuon within the same session. (401). ^ P resentlQ g invention also provides a method for 
Thus the presenting invention provides a method for J™"^ modifying a user's query. In an embodiment, 
updating a keyword index for searching a data object in a ^ raeth °d includes analyzing at least one of the queries into 
repository. The method includes providing a search engine 10 kevwords ( or combinations of query words); and matching 
for the data repository. The search engine is available for at least one 9 uer y word a 6 ainst both an auxiliary 
responding to at least one query from a user in a user session. keyword index, created from keywords of previous users' 
The data repository has an associated keyword index linking sessions and the keyword index associated with the data- 
keywords with each of a plurality of data objects. The base * T° e results of the two matches may then be combined 
method also includes the steps of: receiving at least one and presented to the user. 

query from the user in a user session; analyzing the query; 15 The dynamic modification can also include an interme- 

matching the query against the keyword index; responding diary step, where matching keywords from the auxiliary 

to the query with at least one particular data object from index can be presented to the user for approval before 

among the data objects; determining user's satisfaction with mer giag or before submitting them as a query to the search 

the search results; and updating the keyword index to form engine. 

a modified keyword list which includes at least a portion of 20 In cnm „ m u~*; ,u * e i • ■ i * 

the analyzed query linked to the at least one particular data J* T ^^Tt ' a ? ° f mcludcs 

object F 10 using the modified keyword index in a subsequent user 



_ , ,. sessions. 

In some embodiments, the method further includes i . . . A , , , „ , 

^to^^U*^*^^ ^ step ^ 

In some embodiments, the step of analyzing includes the ~ 1 tg^xWow jfflj 

br^» ^ mt i isstssrt to i " 

and/or remove words that are not relevant for search; and/or ™V° "T* 1 Chmg '\ l ° ^ maSter mdeX " 
normalize words, such as by morphological analysis or by 30 I* 16 P res f tin g invention also provides an apparatus for 

stemming; and/or substitute Boolean operators for natural- "P^g at . least onc keyword search index, searchable by a 
language words like "both", "and", "or". If the query is engm ? res P ondm g t0 at least one query from a user 

Boolean, the analysis consists of breaking up a complex f i? Kt SeSS10n " M example embodi ment is shown in FIG. 

Boolean expression into logically equivalent simplified 5> llie a PP aratus jnchides: a statistics module (130) which 

Boolean expressions 35 stores tirae ^ fre quency statistics of usage of keywords 

Thestepofmatchingconsistsofcomparingeachkeyword ™* 'ach tf a Plurality of data objects in a 

(or combination of keywords) of the analyzed query wul the daUbMC; * ^"i^AS ^ St ° re ^ at leaSt 

keywords in the keyword index. SeveL mlchmg algt .7 W ( ^ t0 t " ^ ° nC 

rithms may be used-perfect string match, partial match of TaZ a , Sc°w pU ^ a l ^ ™ d 

query portion to keyword, etc. 40 upd *?5 , modulc , 125 > to updatc thc mdcx b ? fanning a 

In some embodiments, the step of responding includes: 7,^™^ ^ v ich includes at least a portion of the 

storing all queries of a u£r session'; anal/ig at feast one of J^LE^S ^ ' °" 

45 pri ^TcLhf f odi T ents V m t TT m ,he recciV H/ C T 

words; and determining the at least one particular data object 45 E^^SL? St °T g *Z* l ca f fi On^ query; and/or the 

based on the result of the step of matching. Sometimes, the "£5£ ?° empl T me mod ? e * ""J"™? bst « a 

step of responding further includes sending the userTfirst ""^Si 1 cmbodimcQ ^ c rccc ™ 

description of the at least one particular dfu object. I 'TT^f^ * £5? * 

t„ ■ i • , lcast oac query, and/or a query analyzing module (520} to 

J«fZl^ T» ^ m l h ° d ^ 1DdudCS a0aly ' 50 break »P ^ q uer y into wo'ds, keywordsor simpler Boolean 

^ ( T?r P ff7 teSySt r r S nSein<)rder expressions. In some embodimcms the updaSJ moduk 

to detect a first level of sanction and/or higher levels of ^tes a matching module (512) to match the aUe^one 

user satisfaction; and/or the step of determining complete auerv (he kp jL rrk , nH . n JzSZ^Z . 

user satisfaction, which indicates the end of a uL session. ^^daU 

In some embodiments the step of updating occurs only if 55 some embodiments, the responder includes a merging mod- 

a sufficient level of user s satisfaction was determined from u le (514), that merges the results from the search engine 

the user s response. Updating includes updating statistics for based on the keyword index with results from the auxiliary 

T T, f ,° f ^ at - u'u ° ne , SeSSi ° D ' ^ bascd on ^ updating module data. Sometimes, the 

wherein the at least one word is linked with the at least one responder (135) sends the user a first description of the at 

particular data object; and/or the statistics include frequency 60 least onc particular data object; and/or the receiver receives 

of utflization; and/or the statistics include a time stamp; a user resp onse indicating a first level of satisfaction; and/or 

and/or the statistics include a time stamp and further mc ICS pondcr (135) satisfies thc at least one query bascd 

includes the step of employing the statistics for updating the upon the user response; and/or the receiver includes a 

w™ h . md ^ ; b and/or J hc / lc P K of amoving at least one detector (515) to detect a session terminating response to the 

keyword m the keyword index based on the statics. 65 at l east cue user query; and/or the updating module updates 

In some embodiments, the step of updating includes the statistics for at least one word in each query of the at least 

step of adding to the keyword index at least one keyword one query, wherein the at least one word is linked with the 
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at least one particular data object; and/or the words in at least skilled in the art that modifications to the disclosed embodi- 

one query of the at least one query form a Boolean Expres- ments can be effected without departing from the spirit and 

siom and/or the statistics includes frequency of utilization scope of the invention. The described embodiments ought to 

and/or includes a time stamp; and/or the updating module oe construed to be merely illustrative of some of the more 

employs the statistics to update the keyword index; and/or s prominent features and applications of the invention. Other 

the updating module removes at least one keyword in the beneficial results can be realized by applying the disclosed 

keyword index based on the statistics; and/or the updating invention in a different manner or modifying the invention 

module adds at least ooe keyword to the keyword index; m wavs known to those familiar with the art. 

and/or the updating module employs a removal policy; Having thus described our invention, what we claim as 

and/or the user is connected to the repository via an Internet to ncw and dcsirc to sccurc b * 1x11613 Patent b * follows 

connection; and/or the terminating response indicates the L ~ met 5° d c PPP r ^g updating a keyword index for 

user's satisfaction of the results of the first session. searching a data object in a repository, including the steps of: 

me presenting invention also provides an article of ^.^^.^T^^ 
manufacture comprising a computer usable medium having query from a ^ m a m sessi saj § daU reposito 
computer readable program code means embodied therein 15 having m assodated keyword index k £ / s 
for causing an updating of a keyword index, the computer with each of a plurality of data objects- 
readable program code means in the article of manufacture rcccivin ^ at lcasl onc from ^ 
comprising computer readable program code means for >• , • , . , 

causing a computer to effect any method of the present TS^I <t ^ ™*f ^ °°* 

invention 20 particular data object from among said data objects; 

ar2 C J^T g inVCnti ° n ^° r Vid S 3 C °, mpUt u r Pr °' u P datm g said M«l *d« ^ form a modified keyword 

cZZ I =t , C ° h 7 nS1Dg 1 C °T ter h u Vmg list which iDcludcs at lcast a P^on of said at IcaWone 

computer readable program code means embodied therein query in keywords ^ to at , eas| 

for causing an updating of a keyword index, the computer data object p«""um 

n C r nH?^n Pr ° gram ^ * J* "mputer piognun 25 2 . A method "as recited in claim 1, further comprising 

for £?. P g Tf * feadable P m Z™ m ™ dt meaos employing the modified keyword list in a subsequent user 
for causing a computer to effect any method of the present 1 se5 sj on \ 4 

invention. 3. A method as recited in claim 1, wherein the step of 

Ihe presentmg invention also provides a program storage responding includes- 

of I^, d . able ^ Tm 1 k ' r gibly , emb ° d y^ a P 10 ^ storing all queries of said at least one query session; 
of instructions executable by the machine to perform method anai^™ th» a • . n 

steps for dynamically modifying an initial search of a search cxSions S1D1P 

engine, the method steps comprising the steps of any method , f. . . ' . 

of the present invenUon " matching said at least one query with said keywords; and 

The present invention can be realized in hardware 35 ****** said at least one particular data object based 

software, or a combination of hardware and software. A 4 7 f P °, ^t^u ■ u 

visualization tool according to the present invention can be T .LtZ! I " ?! Wh< f m ^ Step ° f 

realized in a centralized fashion in one computer system, or EES?* ? I' , < ? n, ? nscs "f 1 "?* ? c user a first 

in a distributed fashion where different elements are spread ^1 ZhlTl ^ ° n ^ a ^ hr data ob J ect - 

across several interconnected computer systems. Any kind " JLi£^J? "ST 4 J ^'."l ^ Step ° f 

of computer system-or other apparatus adapted for carry- 3SS n 2r ^termmmg said user's first 

ing out the methods described herein-is suited. A typical kvd of saUsfactlon wth the searc h «sults by analyzing the 

combination of hardware and software could be a general USt J ?/ „ . J . , . 

purpose computer system with a computer program that, 4 - J nn t * 10 ^ ?' wh ?^ em *" Step ° f 

when being loaded and executed, controls Ihe computer 45 res f "^S^er compnscs satisfying said user response, 

system such that it carries out the methods described herein ' J* ^t?* recited ,n claim 6 > wherein *e step of 

The present invention can also be embedded in a computer res P ondm e ™« comprises determining complete satis- 

program product, which comprises all the features enabling '^L^ T', mA ^ u *^ ° nd f a ^ ™?™> 

the implementation of the methods described herein, and m « I°*TT dctCrm ^ mg f 8 . cnd of a W SC5Slon - 

which-when loaded in a computer system-is able to carry S ° * V £ m ° laim 7 ' wherem me ste P of 

out these methods. responding further comprises detecting a terminal success to 

_ said at least one user query. 

Computer program means or computer program in the 9. A method as recited in claim 1, where the step of 

or notahon, of a set of instructions intended to cause a 55 each query of said at least one query, wherein said at least 
system having an information processing capability to per- one wor / is Unked wilh said a T le ^ t one par ^ ^ 
form a particular funcUon either directly or after either or object F 

codt t TJ^^rt^T^ 10 ^f er f Iang T 6e ,' 10 - A mcthod M rccited in claim 9 ' ^rein words in at 
form reproduction in a different material least one query of said at least one query form a Boolean 

60 Expression. 

It is noted that the foregoing has outlined some of the U. A method as recited in claim 9, wherein the statistics 

more pertinent objects and embodiments of the present include frequency of utilization. 

invention. This invention may be used for many image or 12. A method as in claim 9, wherein the statistics include 

image -like applications. Thus, although the description is a time stamp. 

made for particular arrangements and methods, the intent 65 13. A method as recited in claim 9, wherein the statistics 

and concept of the invenUon is suitable and applicable to include a time stamp and further comprises employing the 

other arrangements and appUcations.lt will be clear to those statistics for updating said keyword index 
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14. A method as recited in claim 9, further comprising 
removing at least one keyword in said keyword index based 
on said statistics. 

15. A method as recited in claim 14, wherein the step of 
removing is repeated in accordance with a removal policy. 

16. A method as recited in claim 9, further comprising 
adding at least one keyword in said keyword index based on 
said statistics. 

17. A method as recited in claim 1, wherein the user is 
connected to the repository via an Internet connection. 

18. An apparatus for updating at least one keyword search 
index, searchable by a search engine responding to at least 
one query from a user in a first session, said apparatus 
comprising: 

a statistics module which stores time and frequency 



10 



10 



32. An apparatus as recited in claim 27, wherein the 
updating module removes at least one keyword in said 
keyword index based on said statistics. 

33. An apparatus as recited in claim 32, wherein the 
updating module employs a removal policy. 

34. An apparatus as recited in claim 32, wherein the 
updating modules adds at least one keyword in said keyword 
index based on said statistics. 

35. An apparatus as recited in claim 18, wherein the user 
is connected to the repository via an Internet connection. 

36. An article of manufacture comprising a computer 
usable medium having computer readable program code 
means embodied therein for causing an updating of a 
keyword index, the computer readable program code means 



. . a j, ■ 7 . . . — - ^;" U1U uuuipuici icduamc program coae means 
statistics of usage of keywords associated with each of 15 m said arlicle of manufacture comprising computer readable 
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a plurality of data objects in a database; 
a receiver to receive and store said at least one query; 
a rcsponder to respond to said at least one query with at 

least one particular data object; and 
an updating module to update said index by forming a 

modified keyword list which includes at least a portion 

of said at least one query in keywords linked to said at 

least one particular data object. 

19. An apparatus as recited in claim 18, wherein the 
receiver comprises a cache for storing said at least one query. 

20. An apparatus as reciled in claim 18, wherein the 
updating module employs the modified keyword list in a 
subsequent user session. 

21. An apparatus as recited in claim 18, wherein the 
receiver includes: 

a memory to store queries included in said at least one 
query; 

a query analyzer to analyze the query into keywords or 
simpler Boolean expressions; 

a matching module to match said at least one query with 
said keywords and to determine said at least one 
particular data object based on the results of the match. 

22. An apparatus as recited in claim 21, wherein the 
responder sends the user a first description of said at least 
one particular data object. 

23. An apparatus as recited in claim 22, wherein the 
responder comprises a merging module to merge the results 
obtained from the search engine with the results obtained by 
matching the user's query against keywords in an auxiliary 
index. 

24. An apparatus as recited in claim 23, wherein the 45 
receiver receives a user response indicating a first level of 
satisfaction. 

25. An apparatus as recited in claim 24, wherein the 
responder satisfies said at least one query based upon said 



program code means for causing a computer to effect: 
providing a search engine for the data repository, said 
search engine available for responding to at least one 
query from a user in a user session, said data repository 
having an associated keyword index linking keywords 
for each of a plurality of data objects; 
receiving said at least one query from said user; 
responding to said at least one query with at least one 
particular data object from among said data objects; 
and 

updating said keyword index to form a modified keyword 
list which includes at least a portion of said at least one 
query in keywords linked to said at least one particular 
data object. 

37. An article of manufacture as recited in claim 36, the 
computer readable program code means in said article of 
manufacture further comprising computer readable program 
code means for causing a computer to effect employing the 
modified keyword fist for a subsequent user session. 

38. An article of manufacture as reciled in claim 36, 
wherein the step of responding includes: 

storing all queries of said at least one query session; 
analyzing the query into keywords and/or simple Boolean 
expressions; 

matching said at least one query with said keywords; and 
determining said at least one particular data object based 
on the result of the step of matching. 

39. A computer program product comprising a computer 
usable medium having computer readable program code 
means embodied therein for causing an updating of a 
keyword index, the computer readable program code means 
in said computer program product comprising computer 
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26. An apparatus as recited in claim 25, wherein the re * dable program code means for causing a computer to 
responder includes a detector to detect a session terminating 



response to said at least one user query. 

27. An apparatus as recited in claim 18, where the 
updating module updates statistics for at least one word in 
each query of said at least one query, wherein said at least 
one word is linked with said at least one particular data 
object. 

28. An apparatus as recited in claim 27, wherein words in 
at least one query of said at least one query form a Boolean 
Expression. 

29. An apparatus as recited in claim 27, wherein the 
statistics includes frequency of utilization. 

30. An apparatus as recited in claim 27, wherein the 
statistics includes a time stamp. 

31. An apparatus as recited in claim 27, wherein the 
statistics includes a time stamp and the updating module 
employs the statistics to update said keyword index. 



55 



60 



65 



providing a search engine for the data repository, said 
search engine available for responding to at least one 
query from a user in a user session, said data repository 
having an associated keyword index linking keywords 
for each of a plurality of data objects; 
receiving said at least one query from said user; 
responding to said at least one query with at least one 
particular data object from among said data objects; 
and 

updating said keyword index to form a modified keyword 
list which includes at least a portion of said at least one 
query in keywords linked to said at least one particular 
data object. 
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